AIbase
Home
AI Tools
AI Models
MCP
AI NEWS
EN
Model Selection
Tags
High-Performance Inference

# High-Performance Inference

Xlangai Jedi 7B 1080p GGUF
Apache-2.0
This is a Jedi - 7B - 1080p model quantized using llama.cpp, offering multiple quantization types for users to choose from, balancing file size and model quality.
Large Language Model English
X
bartowski
225
1
Qwen3 235B A22B Mixed 3 6bit
Apache-2.0
This is a mixed 3-6bit quantized version converted from the Qwen/Qwen3-235B-A22B model, optimized for efficient inference on the Apple MLX framework.
Large Language Model
Q
mlx-community
100
2
Qwen2.5 Recursive Coder 14B Instruct
Apache-2.0
A 14B-parameter code generation and comprehension model based on the Qwen2.5 architecture, integrated through the Model Stock method by combining multiple specialized coding models
Large Language Model Transformers
Q
spacematt
39
2
Mixtral 7b 8expert
Apache-2.0
The latest Mixture of Experts (MoE) model released by MistralAI, supporting multilingual text generation tasks
Large Language Model Transformers Supports Multiple Languages
M
DiscoResearch
57.47k
264
Yi 6B
Apache-2.0
Yi-34B-Chat is a bilingual large language model developed by 01.AI, ranking second only to GPT-4 Turbo on the AlpacaEval leaderboard with outstanding performance.
Large Language Model Transformers
Y
01-ai
17.03k
372
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
English简体中文繁體中文にほんご
© 2025AIbase